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Predictive implications of Gompertz’s law. 


Peter Richmond^ and Bertrand M. Roehner^ 

Abstract Gompertz’s law tells us that for humans above the age of 35 the death 
rate increases exponentially with a doubling time of about 10 years. Here, we show 
that the same law continues to hold even for ages over 100. Beyond 106 there is 
so far no statistical evidence available because the number of survivors is too small 
even in the largest nations. However assuming that Gompertz’s law continues to 
hold beyond 106, we conclude that the mortality rate becomes equal to 1 at age 
120 (meaning that there are 1,000 deaths in a population of one thousand). In other 
words, the upper bound of human life is near 120. The existence of this fixed-point 
has interesting implications. It allows us to predict the form of the relationship be¬ 
tween death rates at age 35 and the doubling time of Gompertz’s law. In order to 
test this prediction, we first carry out a transversal analysis for a sample of countries 
comprising both industrialized and developing nations. As further confirmation, we 
also develop a longitudinal analysis using historical data over a time period of almost 
two centuries. Another prediction arising from this fixed-point model, is that, above 
a given population threshold, the lifespan of the oldest person is independent of the 
size of her national community. This prediction is supported by available empirical 
evidence. 


Version of 25 September 2015. 

Key-words: death rate, Gompertz’s law, oldest persons, transversal analysis, longitudinal analysis. 

1: School of Physics, Trinity College Dublin, Ireland. Email: peter_riehmond@ymail.eom 
2: Institute for Theoretical and High Energy Physics (EPTHE), University Pierre and Marie Curie, 
Paris, Eranee. Email: roehner@lpthe.jussieu.fr 




2 

Introduction 

Among social phenomena there are very few that are governed by laws which are 
valid with good accuracy in all times and all countries. Gompertz’s law is one of 
them. For a physicist Gompertz’s law is fairly unusual because it is an exponential 
change whose rate itself changes in an exponential way. One will not be surprised 
that such a process reaches a critical point within a finite time. The present paper 
draws several implications from this observationQ. 

In 1825 Benjamin Gompertz (1779-1865) derived the law named after him from 
life tables for the cities of Carlisle and Northampton. Gompertz’s law states that for 
ages over ti = 35, the mortality rate /i(f) = {l/s)ds/dt, where s{t) denotes the 
population of a cohort in the course of time, increases in an exponential manner: 

fi{t) = di exp [a{t — ti)] di : death rate at age A ~ 35 years 

If this law also holds in old age, an obvious implication is that the extinction of a 
population does not occur asymptotically, but within a finite time interval. This is a 
consequence of the fact that if for an age t 2 the death rate = (1/s) [As/Af] 
in a unit time interval At = 1 reaches 1, then the number of deaths As equals the 
population s{t) which means that the population vanishes for t = t 2 . In other words, 
t 2 represents the strict upper bound of the population’s life time. Just to show that 
this is not purely theoretical, we note that for the population considered in the study 
by Arthur Roger Thatcher and his collaborators (1999), 79% of the males aged 109 
died before reaching 110. 

The data for Gompertz’s law up to age 106 are summarized in Fig. la. The data 
come from two sources. For ages up to 80 (or sometimes 95) the death rates can 
be taken from standard death rate statistics as published in all countries. For ages 
up to 106, the data are taken from Thatcher (1999) based on a global sample of 13 
countries over several years. The monograph actually gives death rates up to age 
113, but beyond age 106 the numbers involved become very small which gives rise 
to substantial random fluctuations. 

Incidentally, such fluctuations explain why studies using national death rate data for 
persons over 100 lead to fairly shaky results. Some studies (e.g. Strehler 1967) 
display a deceleration effect, whereas in others (e.g. the graph for the United States 
in 2003 that accompanies the Wikipedia article entitled “Gompertz-Makeham law of 
mortality”) there appears to be a death rate acceleration. For the age interval 80-106 
Thatcher et al. (1999) show clearly that Gompertz’s law continues to hold. Beyond 
106 there is still no convincing evidence. One will have to wait until studies similar 

'The present paper is the third in a comparative biodemographic investigation which, so far, comprised the following 
steps: Richmond and Roehner (2015 a,b). 
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to the Thatcher study are performed in countries with large populations such as China 
or India. 




Fig. la,b Gompertz’s law. Left: Gompertz’s law is characterized by a death rate which increases exponen¬ 
tially with age: ^{t) = di exp(Q;(t — ti)). Here ti is about 35 and for t > ti one gets a = 0.074 which 
corresponds to a doubling time of 6 = log 2/a = 9.4 years. The empirical part of the death rate curve relies 
on two different sources: (i) first, standard vital statistics; usually such data do not go beyond 95. (ii) the data 
collected by Arthur Thatcher and his collaborators had for objective to cover ages beyond 95. Yet, beyond the 
age of 107, the samples become too small even for very large initial populations. That is why, for ages between 
107 and 120, we make the conjecture that the line can be extrapolated. This extrapolation leads to the fact that 
approximately at age 120 the death rate reaches the value 1,000 per year and per 1,000 persons which means 
that the population vanishes. This upper bound of 120 agrees fairly well with verified (worldwide) maximum 
lifespans: Jeanne Calment (122) and Sai'ah Knauss (119). Right: This graph is a schematic illustration of the 
fixed-point model. The three Gompertz lines are supposed to correspond to different countries and time periods. 
Fig. 5a displays a graph of that kind that is based on real data. Sources: Strehler (1967), Thatcher et al. (1999) 


Fig. lb illustrates the simple idea upon which the present study is based. We refer to 
it as the “fixed-point model”. 

If over age f = 35 Gompertz’s law holds for all countries and if 120 years is the 
end point of human life, then the equation of any straight line which summarizes 
Gompertz’s law for a specific country will be determined by the initial death rate 
from which it starts. If one denotes the logarithm of the death rate by y, according to 
a standard formula, this equation reads: 

/ N V2 — Vi hVi — tiV2 

y = a{t - ti) + q, a = - -—, q = — - - - y = log (death rate), t = age 

12 — ti 12 — ti 

For the initial age G we selected an age such that in any country Gompertz’s law 
holds for t > ti. Observation shows (see Fig. 2a,b) that this is the case for the age 
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intervals above (and including) 35 — 39. This leads to a value of ti equal to the mid¬ 
point of this interval, i.e. ti = 37.5 years. Then , replacing ti, t 2 by 37.5 and 120, 
and 1/2 by log 1000, one gets the following relationship between the exponent a and 
logdi. 

a = —^— logdi + _ _Q Q]^21 logdi + 0.084 (1) 

t2 — h t2 — h 

In the numerical form of this relationship it is supposed that a is expressed in year“^ 
and di in number of deaths per year and per 1,000 population. 

Gompertz’s exponent’s changes and the filter effect 

Transversal analysis 

In order to study the variability of Gompertz’s law across countries, we need to set 
up an appropriate “experiment”. In other words we need to dehne the kind of data 
that we need. 

What data do we need? 

Firstly, because we need data for several countries we must use a source provid¬ 
ing such international data. The Demographic Yearbooks published by the United 
Nations serve our needs well. 

Secondly, in order to be able to deduce the relationship between di and a we need 
observations for a wide range of di. We cannot restrict ourselves to data from indus¬ 
trialized countries because in all these countries di will be limited to a narrow range 
of small values, basically of the order of 1 per 1,000. 

Moreover, since we wish to study the convergence of death rates toward the fixed- 
point at age 120, we would like death rates for age groups which are as close as 
possible to 120. Since at this point we are not particularly interested in the difference 
between male and females we shall use data for both sexes. 

Now that we have defined what we need, let us see what the UN Yearbooks can offer. 
By accessing the Yearbook of 2011 one observes immediately two things, 

• While numbers of deaths by age are given for many countries, death rates are 
given for only about one third of the countries. 

• In the subset of countries for which rates are available only about one third 
of them provide data up to the age group 95-99. For the others, the data are given 
only up to 70, 80 or 85 depending on the country. This will lead us to distinguish 
two groups of countries: group A which contains all countries providing data up 
to age-group 95-99 (such data will be referred to as “complete data”) and group B 
which will contain all other countries (it will be referred to as the “incomplete data” 
group). 
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• In group A there are countries with death rates at age 37.5 that are as low as 0.6 
per 1,000. At the other end of the spectrum one would expect to find high death rates 
in developing countries and particularly in African countries. However, among the 
African countries with sizable populations there are only 4 for which death rates are 
given, namely: Egypt, Sierra Leone, Swaziland and Zimbabwe. Egypt was included 
in group B but its death rate at age 37.5 is di = 1.7 which is not particularly high. 
In the other three countries di = 11.3, 36.3, 38.9. Such values would be quite 
useful but unfortunately the death rate series do not appear reliable. This can be seen 
in two ways. Eirst, by the fact that they are labeled by the UN as I (instead of C 
which means “complete”) and secondly because the death rates do not increase in a 
monotonic way. Thus, for Swaziland the death rates in the age groups 70 — 74 and 
75 — 79 are 48.0 and 45.4 respectively which, although not altogether impossible, is 
highly unlikely. 

In conclusion, we expect reliable results in group A but fairly poor results in group 

B. 

Results 

Eor each and every country, Gompertz’s law holds with high accuracy (see Eig. 2a,b). 
More precisely, when reliable statistics are used the correlation (age, logarithm of 
death rate) is always higher than 0.995. Yet, the slopes may differ substantially. This 
leads to the two following questions. 

(1) Is there a regularity in the variations of the slopes or is it just random? 

(2) If there is a regularity does it follow the fixed-point model? 

(3) If the data are indeed well described by the fixed-point model, how can one 
explain this effect in biological terms? 

Eig. 3a,b answers the questions 1 and 2. 

More precisely one gets the results summarized in Table 1. 

Longitudinal analysis 

Western countries began to collect reliable demographic statistics in the mid-19th 
century. As death rates at age 35-39 were substantial higher at that time can we 
possibly use such data to explore the region of high di that was out of reach in our 
cross-national analysis? As a test-case we consider Erance. In INSEE (1966) one 
can find death rate data by age-group from 1806 on. 

1806 : di = 14.8, 1836 : di = 11.8, 1866 : di = 11.3, 2005 : di = 1.1 

As the data for 1806 may be somewhat less reliable than later ones, we selected: 
1836, 1866 and 2005. Our purpose in taking 1836 and 1866 in spite of the fact that 
they have almost the same di is to control the reliability of the data. 

This longitudinal analysis leads to the results summarized in Eig. 5a,b. 
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Fig. 2a,b Gompertz’s law in two sets of countries. Left: Group (A) of countries which give death rates for 
ages up to 95-99 year old. Right: Group (B ) of countries which provide death rate data only for shorter age 
intervals. In Fig. 3a,b and 4a,b complete and incomplete data will also be on the left and right respectively. 
Ideally all lines are expected to converge toward the end-point (120,1000). Not surprisingly, this convergence 
is clearer in group A than in group B. Source: United Nations Demographic Yearbook, 2011. 
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Fig. 3a,b Exponents of Gompertz’s law as a function of death rates in the 35 — 39 age-group (log d{). 
Left: Group A of countries with complete datasets. The correlation (log di, a) is —0.970. Right: Group B of 
countries with incomplete datasets. The correlation is —0.55. Although the slopes of the regression lines are 
almost the same in the two groups, the accuracy of their measurement is about 5 times better in group A than 
in group B. The exponents predicted by the fixed-point model are within the error bars of the observations. 
Source: United Nations demographic yearbook, 2011. 


Interpretation 

The simplest interpretation which comes to mind for the results described in the 
previous subsection relies on a filter effect. There are two steps in this explanation. 
First, we note that the death rate at age 37.5 is not an isolated number but is related 
to the rates in other age intervals. Thus, for a sample of countries the infant mortality 
rate d{0 — 1) and d{35 — 39) have a cross-correlation of the order of 0.7. Similarly, 
d{35 — 39) has a correlation of same magnitude with d{l — 4). This means that a 
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Death rate at age 35-39 


Death rate at age 35-39 (log scale) 


Fig. 4a,b Doubling times 0 of death rates as a function of death rates in the initial 35—39 age-group (di). 
Left: Group A of countries with complete datasets. Right: Group B of countries with incomplete datasets. 
According to the fixed-point model there should be an hyperbolic relationship between the two variables: 9 = 
log 2/(—a log di + b). The non linearity of the relationship would become more obvious for higher initial death 
rates. In some African countries (e.g. Sierra Leone or Zimbabwe) initial death rates as high as 30 per 1,000 
were reported but the demographic data of such countries appear' fairly uru'eliable (see text). The names of 
North Korea and Pakistan are written in red because the data provided by these countries are acknowledged 
to be incomplete (even for the restricted age range for which data are available). Source: United Nations 
Demographic Yearbook, 2011. 


Table 1 Relationship between initial death rates di and Gompertz’s exponents: a = —alogdi + h 



100 X a 

100 X b 

Prediction of the fixed-point model 

1.21 

8.37 

Group A (complete data) 

Difference with respect to prediction 

1.33 ±0.20 
10% 

9.54 ±0.13 

14% 

Group B (incomplete data) 

Difference with respect to prediction 

1.34 ± 1.0 
11% 

9.37 ±0.41 

12% 

Notes: For clarity the table gives o, h multiplied by one hundred. Comparison of the error bars (probability 
level of 95%) shows that for a as well as b the accuracy of the observations in group A is five fimes beffer fhan 
in group B. 


high (i(35 — 39) was preceded by a range of fairly high death rates at a younger age. 
It should be observed that intuitively such correlations are rather unexpected since 
for for such age intervals the causes of death are fairly different. For the young, say 
for age below 5, the main causes of death are diseases whereas for the age group 35- 
39 the causes of death are mostly external factors, e.g. accident, suicide, homicide. 
The fact that there is nevertheless a correlation is probably related to the nature of 
the social organization. Low infant mortality implies a rich country which in turn 
implies well organized transports (few traffic accidents) and little violence. 

Now, the meaning of the filter effect becomes clear. In a society characterized by a 
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Fig. 5a,b Gompertz’s law in France in different years. Left: The dotted straight lines beyond 95 are the 
regression lines. Ideally one expects all lines to converge toward the end-point (120,1000). In fact, the center of 
the red circle around the convergence point is somewhat higher than 1,000. The fact that the lines for 1836 and 
1866 are almost identical controls the reliability of the data. Right: Regression line for (di, a) and comparison 
with the predicted slope. Source: 2005: United Nations demographic yearbook, 2011; 1836, 1866: INSEE 
(1966) 


high d{35 — 39) there is also a high infant mortality which means that persons who 
for diverse reasons are “fragile” (e.g. low immunity, heart malformation) will die 
in the early years of their lives. It follows that the survivors will be less susceptible 
to the diseases which usually come with age which means a death rate which will 
increase fairly slowly with age. 

Wall effect for the high end of the lifespan distribution 

The wall effect was already illustrated in Fig. 1 by the fact that the survivorship func¬ 
tion becomes strictly equal to zero for age t = 120. Here, however, the implication 
will be stated in the language of probability theory. The survivorship curve describes 
the population of a cohort in the course of time. If the initial population is normalized 
to 1, this curve becomes equivalent to the probability for an individual to reach age 
t. In other words it is: P{X > t} = 1 — F{t) = G{t), where F{t) is the cumulative 
probability distribution function and G{t) the complementary distribution function. 
Thus, f{t) = —dG/dt where f{t)dt = F{t < X < t + dt} is the probability density 
function. It gives the number of people in a cohort whose lifespan is in the interval 
{t, t F dt). The function G{t) for the lifespan in France is given in Fig. 6a. 

The fact that: /(f) = 0 for any f > 120 has two consequences. 

• Maximum values of lifespan will be squeezed within a short interval ending at 
120. For the purpose of comparison Fig. 6b shows also maximum values for height. 
We chose this variable because it is usually considered as the archetype of a Gaussian 
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Fig. 6a,b Comparison of extremes values of height and age. Left: Density function of the duration of 
human life (both sexes together). The survival curve s{t) is identical to the complementary distribution func¬ 
tion: s{t) = 1 — F{t). Thus, the density function is the opposite of the derivative of the survivorship curve: 
f{t) = dF/dt = —ds/dt. The death rate curve is the opposite of the derivative of the logarithm of the sur¬ 
vivorship curve: /r(f) = {l/s)ds/dt == —d\og[s{t)\/dt. Right: Comparison between 80 top values for height 
(men) and length of life (women). The vertical lines show the differences (expressed in number of a) between 
individual top values and the mode (i.e. peak value) of the distribution. In each case the standard deviation a 
was computed from the values in the vicinity of the peak. The wall effect for maximum ages is quite apparent. 
Note that for height the small number of vertical lines is due to the fact that many realizations correspond to 
identical positions because heights are expressed in centimeters rather than in millimeters. Incidentally, it can 
be noted that the extreme values of height do not follow a Gaussian distribution for in a real Gaussian (with 
here <7 ~ 7 cm) existence of heights of 230cm (i.e. T.Ga) would require a population of 1/G(7.6) = 10^^ 
people. The same observation can be made regarding the existence of dwarfs. Source: Survivorship curve: 
Official mortality table (TD73-77) used by insurance companies in France. Extreme values: Wikipedia lists of 
tallest and oldest persons. 


distribution. Incidentally, while indeed true for heights within 2 or 3 cr around the 
mean this is no longer true for the tails of the distribution. 

• Extreme lifespan values are independent of sample size. For any random vari¬ 
able whose distribution function G{t) tends toward 0 (yet without reaching 0) as t 
increases, the largest realizations are conditioned by the size of the sample. Thus, if 
for the height (expressed in cm) one has G(200) = 10“^ one will need a sample of at 
least 1,000 individuals in order to have a chance to get one person with a height over 
200 cm. With this logic (and leaving aside genetic factors) the height of the tallest 
persons will be larger in the United States than in Iceland whose population is 1,000 
times smaller. On the contrary, for age, above an appropriate threshold po, the size 
of the sample will not play any role. This prediction of the fixed-point model is easy 
to check. Tables of oldest persons by country can be found on Wikipedia. Here are 
some cases. The first number gives the population P in millions, the second the age 
t of the oldest person. 
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Belgium: 11, 112.5; Denmark: 5.6, 115.7; France: 66, 122.4 Germany: 80, 115.2; 
Iceland: 0.33, 109.8; Ireland: 4.6, 113.4; Italy: 60, 115.7; Moldova: 3.5, 114.8; 
Russia: 143, 113.0; United States: 319, 119.3. 

There is a correlation of about 0.70 between log P and t but it vanishes for f > 10 
million which means that ~ 10 million. 

Conclusion 

Extension of the fixed point model to other species 

The fixed-point model developed in this article is based on two observations. 

(1) The validity of Gompertz’s law for ages over 40. 

(2) The fact that the upper bound of human life is about 120 years. 



Fig. 7 Age-specific mortality rates for humans and drosophila. Gompertz’s law holds in both cases. As 
seen at the beginning of the paper, the leveling off observed in humans should be attributed to statistical fluctua¬ 
tions due to the small numbers of the survivors. For the leveling off of the drosophila curve we do not yet know 
whether it is spurious or real. Sources: Strehler (1967); Miyo and Charlesworth (2004), Wang et al. (2013). 

So far, the second point remains a conjecture. It was tested through its consequences 
but direct verification will become possible once reliable become available for pop¬ 
ulous Asian countries. Needless to say, it is the birth dates which pose a challenge 
because for present-day centenarians their birth took place in a time where vital sta¬ 
tistical records may not have been completely accurate. 

Naturally, the analysis developed here for humans can be extended to any other 
species for which (i) Gompertz’s law holds and (ii) an upper bound of the life span 
can be demonstrated. As an illustration let us consider the drosophila fruit flies. As 
shown in Fig. 7, their age-specific mortality rates follow Gompertz’s law. 




11 


One may wonder whether the leveling off seen in the drosophila curve is real or 
rather due to statistical fluctuations. There is no doubt that the question could be 
settled by performing repeated observations on large populations. Anyway, it should 
be observed that there can be an upper bound of the life span even if there is a leveling 
off. Only a steady decrease of /i(f) would prevent it from reaching the terminal rate 
of 1,000 per 1,000. 

What can be expected from the observation of populations of drosophila? The flrst 
question is of course to see whether there is a flxed point or not and if there is one 
how robust it is with respect to changes in the parameters such as temperature, light, 
male/female ratio which shape the living conditions of the drosophila. It is obvious 
that if living conditions are not appropriate the flxed-end point will remain out of 
reach. On the contrary, conditions under which some members of a cohort will 
reach the flxed-end point will deflne the “envelope” of acceptable living parameters!^. 
Within this envelope it will be possible to study the influence of various parameters, 
including “social” variables such as the male/female ratio (in this respect see Carey 
et al. 2002 and Costa et al. 2010). In this way, it may be possible to test the 
relationship between di and a in an experimental way. That should lead to clearer 
and more accurate results than the observational method that one must use for human 
populations. 

Naturally, the drosophila fruit flies are not the only living organisms for which Gom- 
pertz’s law holds. For instance, it was shown by Raymond Pearl (1941) that the 
lifetables of flour beetles are fairly similar to human life tables. 

Comments on earlier papers 

The fact that the slope of Gompertz’s law is country and period dependent has been 
observed as soon as international demographic data became available (e.g. Statis- 
tique 1907, INSEE 1954). However, the explanations put forward in such studies 
were very different from the one given in the present paper. In a sense this is under¬ 
standable because the validity of Gompertz’s law in the age range 95 — 106 which 
is a key-element in our explanation was clearly established only in 1999 through the 
work of Thatcher and his collaborators. 

In the following lines we briefly discuss two studies: Strehler (1960, 1967) and 
Beltran-Sanchez et al. (2012). A third one. Finch (1990), is mentioned in the refer¬ 
ence section. 

The international data used in Strehler (1960, 1967) are from the UN Demographic 
Yearbook of 1955 and concern 32 countries. Quite surprisingly the authors study 
the regression between a and the “logarithms of extrapolated hypothetical mortality 


^The word “envelope” is used in the same sense as for the envelope of flight parameters of an aircraft. 
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rates at age 0”. Is this not a case of circular reasoning? In the present paper the 
death rates at age 37.5 were taken from death data independently of the regression 
procedure. 


The objective of the paper by Beltran-Sanchez et al. is different. These authors wish 
to determine if there is a connection between infant or child mortality, mid-age mor¬ 
tality and the slope of Gompertz’s law. This is indeed an important question. The 
author show that between 1800 and 1915 there is indeed a connection between early 
life mortality and late life mortality. Why did they limit their study to 1800-1915? 
One can guess that they left aside following decades because during the 20th cen¬ 
tury medical progress brought about a dramatic decline in early life mortality which 
would have made the effect they wanted to study all too obvious. By limiting them¬ 
selves to the period prior to 1915, they have a better chance to observe the biological 
aspect independently of the impact of medical advances. Another way to achieve that 
objective would have been to study this effect on animal populations. This brings us 
back to the proposition that if carried out in parallel with demographic studies, the 
investigation of animal populations can provide valuable additional insight. 
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